Master ’ s Thesis : Application of Fuzzy Classification to a Data
نویسنده
چکیده
The ability to analyze large amounts of data for the extraction of valuable information presents a competitive advantage for any organization. The technologies of data warehousing, OLAP, and data classification support that ability. The data warehouse is a central data pool which integrates heterogeneous data sources and provides strategic information for analysis and decision support. OnLine Analytical Processing (OLAP) presents an approach to data analysis where data is consolidated and aggregated with respect to multiple dimensions of interest. The idea is to consolidate large amounts of data by summarizing and aggregating data elements for every cell of a data cube. Classification of data elements reduces an arbitrarily high number of data elements into an arbitrarily small set of classes, which highly reduces the granularity of data. In OLAP, classification is used for the consolidation of dimensional attributes. Fuzzy classification is achieved by modeling data element classes with fuzzy sets. Thus, data elements are assigned to one or many classes to a certain degree. This thesis researches the application of fuzzy classification to OLAP data analysis. Specifically, the approach is the fuzzification of dimension classification in OLAP cubes. A framework has been developed, which allows the description of data cubes with fuzzy dimension hierarchies. This framework has been implemented in a prototype computer program capable of calculating and manipulating OLAP cubes with fuzzy dimension classification. This prototype allows the definition of fuzzy contexts on dimensional attributes, as well as the definition of OLAP cubes on base relations. The levels of consolidation for every dimension can be drilled down, rolled up, sliced and diced. Data consolidation can be achieved by crisp or fuzzy classification.
منابع مشابه
Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملAdaptive Neuro-Fuzzy Inference System application for hydrothermal alteration mapping using ASTER data
The main problem associated with the traditional approach to image classification for the mapping of hydrothermal alteration is that materials not associated with hydrothermal alteration may be erroneously classified as hydrothermally altered due to the similar spectral properties of altered and unaltered minerals. The major objective of this paper is to investigate the potential of a neuro-fuz...
متن کاملEvaluation of geomorphology method application for flood Hazards risk classification using Fuzzy Logic (Case study: Ojan Chay drainage basin)
Past decades damage by floods in Iran and on the other of the world has shown that we have still much work to do to cope with this problem. Hence, the study of these events and development of more effective adaptation and mitigation policies has become a priority, in other parts of the globe. First step in achieving flood risk assessment is data collection. Availability, suitability and quality...
متن کاملApplication of Artificial Neural Networks in a Two-step Classification for Acute Lymphocytic Leukemia Diagnosis by Blood Lamella Images
Introduction: This study aimed to present a system based on intelligent models that can enhance the accuracy of diagnostic systems for acute leukemia. The three parts including preprocessing, feature extraction, and classification network are considered as associated series of actions. Therefore, any dysfunction or poor accuracy in each part might lead in general dysfunction of...
متن کاملClassification of Streaming Fuzzy DEA Using Self-Organizing Map
The classification of fuzzy data is considered as the most challenging areas of data analysis and the complexity of the procedures has been obstacle to the development of new methods for fuzzy data analysis. However, there are significant advances in modeling systems in which fuzzy data are available in the field of mathematical programming. In order to exploit the results of the researches on ...
متن کامل